AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Full-stack Speech Processing

# Full-stack Speech Processing

Wavlm Base
WavLM is a large-scale self-supervised pre-trained speech model developed by Microsoft, pre-trained on 16kHz sampled speech audio, suitable for full-stack speech processing tasks.
Speech Recognition Transformers English
W
microsoft
28.33k
7
Wavlm Large
WavLM is a large-scale self-supervised speech pre-training model developed by Microsoft, supporting full-stack speech processing tasks and excelling in the SUPERB benchmark.
Speech Recognition Transformers English
W
microsoft
396.53k
74
Wavlm Base Plus
WavLM is a large-scale self-supervised pretrained speech model developed by Microsoft, pretrained on 16kHz sampled speech audio, suitable for various speech processing tasks.
Speech Recognition Transformers English
W
microsoft
673.32k
31
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase